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Abstract 

If the nodes of a graph are considered to be identical barrels - featur¬ 
ing different water levels - and the edges to be (locked) water-filled pipes 
in between the barrels, one might consider the optimization problem of 
how much the water level in a fixed barrel can be raised with no pumps 
available, i.e. by opening and closing the locks in an elaborate succession. 
This problem originated from the analysis of an opinion formation process 
and proved to be not only sufficiently intricate in order to be of indepen¬ 
dent interest, but also algorithmically complex. We deal with both finite 
and infinite graphs as well as deterministic and random initial water levels 
and find that the infinite line graph, due to its leanness, behaves much 
more like a finite graph in this respect. 


1 Introduction 

Imagine a plane on which rainwater is collected in identical rain barrels, some 
of which are connected through pipes (that are already water-filled). All the 
pipes feature locks that are normally closed. If a lock is opened, the contents 
of the two barrels which are connected via this pipe start to level, see Figure 1. 
If one waits long enough, the water levels in the two barrels will be exactly the 
same, namely lie at the average of the two water levels (a and b) before the 
pipe was unlocked. 

After a rainy night in which all of the barrels accumulated a certain amount 
of precipitation we might be interested in maximizing the water level in one 
fixed barrel by opening and closing some of the locks in carefully chosen order. 

In order to mathematically model the setting, consider an undirected graph 
G = (V,E), which is either finite or infinite with bounded maximal degree. 
Furthermore, we can assume without loss of generality that G is connected 
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Figure 1: Levelling water stages after just having opened a lock. 


and simple, that means having neither loops nor multiple edges. Every vertex is 
understood to represent one of the barrels and the pipes correspond to the edges 
in the graph. The barrels themselves are considered to be identical, having a 
fixed capacity C > 0 . 

Given some initial profile {77 o(w)}«g y e [ 0 , C']'^, the system is considered to 
evolve in discrete time and in each round we can open one of the locked pipes and 
transport water from the fuller barrel into the emptier one. If we stop early, the 
two levels might not have completely balanced out giving rise to the following 
update rule for the water profile: If in round k the pipe e = (cc, y) connecting 
the two barrels at sites x and y, with levels rjk-i{x) = a and yk-iiy) = b 
respectively, is opened and closed after a certain period of time, we get 

rjkix) = a + Hkib-a) , . 

miy) = b + fik {a - b) ^ 

for some yk G [ 0 , 5 ], which we assume can be chosen freely by appropriate 
choice of how long the pipe is left open. All other levels stay unchanged, i.e. 
Vkiw) = r]k-iiw) for all w G V \ {x,y}. 

The quantity of interest is then defined as follows: 

Definition 1 

For a graph G = {V,E), an initial water profile {?7o(w)}ug\/ and a fixed vertex 
V G V (the target vertex), let a move sequenee be given by a list of edges and 
time spans that determines which pipes are opened (in chronological order) and 
for how long. Let then k{v) be defined as the supremum over all water levels 
that are achievable at v with move sequences consisting of finitely many rounds, 
i.e. 

k{v) := supjr S K, there exists T S No and a move sequence s.t. r]T{v) = r}. 

Readers familiar with mathematical models for social interaction processes 
might note that ( 1 ) basically looks like the update rule in the opinion formation 
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process given by the so-called Deffuant model for consensus formation in social 
networks (as described in the introduction of [5]), only /r can change from update 
to update and the bounded confidence restriction is omitted. This however is no 
coincidence: The situation described in the context above arises naturally in the 
analysis of the Deffuant model where the question is how extreme an opinion can 
a fixed agent possibly get given an initial opinion profile on a specified network 
graph, if the interactions take place appropriately. 

In order to tackle this question, Haggstrom [4] invented a non-random pair¬ 
wise averaging procedure, which he proposed to call Sharing a drink (SAD). 
This procedure ” which is the main focus of the second section - was originally 
considered on the infinite line graph only, i.e. the graph G = (V, E) with Y = 1L 
and E = {{v,v + 1), v € Z}, but can immediately be generalized to any graph 
(see Definition 2) and is dual to the water transport described above in a sense 
to be made precise in Lemma 2.1. 

In Section 3, we will deal with the water transport problem on finite graphs. 
After formally introducing the idea of optimal move sequences, we investigate 
both their essential building blocks and the effect of simultaneously opened 
pipes. In subsection 3.3, being a collection of examples, we will in fact deal 
with both situations - the one in which we consider the initial water levels to be 
deterministic and the other in which they are random. In the latter case k{v) 
obviously becomes a random variable as well since it strongly depends on the 
initial profile. On non-transitive graphs (see Definition 8) its distribution can 
moreover depend on the chosen vertex v - even for i.i.d. initial water levels, see 
Example 3.2. 

In the fourth section, we extend the complexity consideration touched upon 
in some of the examples from Section 3. We show that it is an NP-hard problem 
to determine k{v) for a given finite graph, target vertex v and initial water profile 
in general, something that might be considered as a valid excuse for the fact 
that we are unable to give a neat general solution when it comes to optimal 
move sequences in the water transport problem on finite graphs, as dealt with 
in Section 3. 

As opposed to the two precedent sections. Section 5 is devoted to infinite 
graphs. We consider i.i.d. initial water levels (with a non-degenerate marginal 
distribution) and detect a remarkable change of behavior: On the infinite line 
graph, the highest achievable water level at a fixed vertex depends on the initial 
profile in the sense that is has a non-degenerate distribution, just like on any 
finite graph. If the infinite graph contains a neighbor-rich half-line (see Defini¬ 
tion 7), however, this dependence becomes degenerate: For any vertex v € V, 
the value k{v) almost surely equals the essential supremum of the marginal dis¬ 
tribution. This fact makes the infinite line graph quite unique: It constitutes 
the only exception among all infinite quasi-transitive graphs, to the effect that 
k(v) is a non-degenerate random variable - an observation which is captured in 
the last theorem: the nonetheless central Theorem 5.3. 
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2 Connection to the SAD-procedure 


Let us first repeat the formal definition of the SAD-procedure: 


Definition 2 

For a graph G 
setting 


{V,E) and some fixed vertex v G V, define {^o(u)}iiGy by 


Co(m) 


1 for u = V 
0 for u V. 


In each time step, an edge {x,y) is chosen and the profile {^o(w)}ugv updated 
according to the rule (1) with {^/c(M)}uGy in place of {Tik{u)}uev- On® can 
interpret this procedure as a full glass of water initially placed at vertex v (all 
other glasses being empty), which is then repeatedly shared among neighboring 
vertices by each time step choosing a pair of neighbors and pouring a ^fc-fraction 
of the difference from the glass containing more water into the one containing 
less. Let us refer to this interaction process as Sharing a drink (SAD). 


Just as in [4], the SAD-procedure can be used to describe the composition 
of the contents in the water barrels after finitely many rounds of opening and 
closing pipe locks. The following lemma corresponds to La. 3.1 in [4], but since 
the two dual processes (water transport and SAD) evolve in discrete time in our 
setting, the proof simplifies somewhat. 

Lemma 2.1 

Consider an initial profile of water levels {r]Q{u)}uev on a graph G = (V., E) 
and fix a vertex v G V. For T S Nq define the SAD-procedure that starts with 
f,o{u) = Sy{u) (see Definition 2) and is dual to the chosen move sequence in the 
water transport problem in the following sense: If in round k S {1,. .. ,T} the 
water profile is updated according to (1), the update in the SAD-profile at time 
T — k G {0,..., T — 1} takes place along the same edge and with the same choice 
of fXk- Then we get 

Vriv) = friu) rioiu). ( 2 ) 

uev 


Proof: We prove the statement by induction on T. For T = 0, the statement 
is trivial and there is nothing to show. For the induction step fix T € N and 
assume the first pipe opened to be e = {x,y). According to (1) we get 


?7i(u) 


( Mu) 

(1 - pi) rio{x) I-m r]o{y) 
(1 - pi) rio{y) I-Pi T]o{x) 


if u ^ {x,y} 
a u = X 
if u = y. 
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Let us consider {rii{u)}uev as some initial profile {r]Q{u)}uev■ By induction 
hypothesis we get 

Vt-i{v) = iT-iiu)Vo{u) 

uev 

= Ct-i(w)%(w) + ((1 - Mi)CT-i(a^)+M iCt-i(2/)) 

uGV^\{a:,y} 

+ ((1 - fj-i)^T-iiy) + Ml Mo(y), 

where = r]T{v) and 0 < t < T — 1, is the SAD-procedure 

corresponding to the move sequence after round 1. As by definition the SAD- 
procedure ^ arises from by adding an update at time T — 1 along edge e with 
parameter /ii, we find ^k{u) = Cfe(w) for all /c e {0,..., T— 1} and u £ V as well 
as 

{ ^T-i (u) = Ct- 1 (w) if ^ {2;, y} 

(1 - Mi)CT-i(a;)-f AiiCT-i(2/) if u = x 

(l-Mi)CT-i(2/)+Mi^T-i(a;) if u = y, 
which establishes the claim. □ 

In the following sections, we want to consider not only deterministic but also 
random initial profiles of water levels. Having this mindset already, it might be 
useful to halt for a moment and realize that the statement of Lemma 2.1 deals 
with a deterministic duality that does not involve any randomness (once the 
initial profile and the move sequence are fixed). 

Before we turn to the task of rising water levels, let us prepare two more 
auxiliary results. The first one follows directly from the energy argument that 
was used in the proof of Thm. 2.3 in [4]: 

Lemma 2.2 

Given an initial profile of water levels {? 7 o(u)}„gy on a graph G = {V,E), fix 
a finite set A C V and a set Ea E of edges inside A that connects A. If 
we open the pipes in Ea - and no others - in repetitive sweeps for times long 
enough such that fik ^ £ for some fixed e > 0 in each round (cf. (1)), then 
the water levels inside the set A approach a balanced average, i.e. converge to 
the value Mo('i')- The corresponding dual SAD-profiles started with 

f,o{u) = 6v{u), u £ V, converge uniformly to 6 a for all v £ A. 

Proof: Let us define the energy after round k inside A by 

Wk{A) = J2{Vk{v)f- 

v£A 

A short calculation reveals that an update of the form (1) reduces the energy 
by 2yt| {h — a)^, where the updated water levels were a and b respectively. If 
p,k is bounded away from 0, the fact that Wk{A) > 0 for all k entails that the 
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difFerence in water levels |6 — a| before a pipe is opened can be larger than any 
fixed positive value only finitely many times. In effect, since any pipe in Ea is 
opened repetitively we must have \rjk{u) — Tjk{v)\ — >■ 0 as /c —>■ oo for all edges 
(u,f) € Ea- As the updates are average preserving, the first part of the claim 
follows from the fact that Ea connects A. 

The second part of the lemma follows by applying the same argument to the 
dual SAD-procedure. □ 

The following lemma constitutes an extremely narrowed variant of Thm. 2.3 
in [4] which applies to graphs more complex than line graphs as well and will 
come in useful in Example 3.3: 

Lemma 2.3 

Fix a (connected) graph G = (V, E) and a vertex v € V. For any w G V \ {w}, 
the supremum of taken over all times k and SAD-procedures started with 

f,o{u) = 6y{u), u G V, is less than or equal to 

Proof: If the SAD-procedure is started with a full glass of water at v ^ w, 
the assumption that the amount at w can rise above | leads to the following 
contradiction: Assume k to be the first time s.t. (,k{w) > Then in round k 
node w necessarily pooled the water with some neighbor u, that had more water 
than w. But since this relation is preserved by an update, it implies 

fk{w) + (k{u) > 2(k{w) > 1 , 

which is impossible as the amount of water shared always sums to 1 . □ 

To round off these preliminary considerations, let us collect some results 
about SAD-profiles from [4] - partly already mentioned - into a single lemma 
for convenience. 

Lemma 2.4 

Consider the SAD-procedure on a line graph, started in vertex v, i.e. with 
fo{u) = dy{u), uGV. 

(a) The SAD-profiles achievable on line graphs are all unimodal. 

(b) If the vertex v only shares the water to one side, it will remain a mode of 
the SAD-profile. 

(c) The supremum over all achievable SAD-profiles started with Sy at another 
vertex w equals where d is the graph distance between v and w. 

The results in [4] actually all deal with the infinite line graph, but it is 
evident how the arguments used immediately transfer to finite line graphs. Part 
(a) hereby corresponds to La. 2.2 in [4], part (b) to La. 2.1 and part (c) to 
Thm. 2.3. The argument Haggstrom [4] used to prove the statement in (c) for 
the infinite line graph can in fact be generalized to prove the result for trees 
without much effort, as was done by Shang (see Prop. 6 in [7]). 


6 


In fact, we believe that not only the cut back statement from Lemma 2.3 but 
also the natural generalization of Thm. 2.3 in [4] holds true for general graphs. 
Our attempts to prove the generalization to non-tree graphs have, however, 
turned out unsuccessful. 


3 Water transport on finite graphs 

In this section, we consider the underlying network to be finite, i.e. \V\ = n G N. 
In order to increase the water level at our fixed site v one could in principle start 
by greedily trying to connect the barrels with the highest water levels to the one 
at V. However, optimizing this idea is far from being trivial. Let us first define 
optimal move sequences and then reveal some properties and building blocks 
that they share. 

Definition 3 

For fixed v G V and a given initial water profile {r]o{u)}uGV let ip G {Ex [0, 5])^, 
where cpk = {ek,fJ-k), be called a finite move sequence if T S Nq. ip is & finite 
optimal move sequence if opening the pipes ei,...,eT in chronological order, 
each for the period of time that corresponds to pk in (1), will lead to the final 
value r]T{v) = k{v). 

For any move sequence ip G {E x [0, 5])^, we will denote by the 

SAD-profile that corresponds to ip via the duality laid down in Lemma 2.1. 

If no finite optimal move sequence exists, let us call $ = {ip^'^\ m G N} 
an infinite type optimal move sequence, provided that G {E x [0,1])^"* is 
a finite move sequence for each m G N, achieving rjXm (^) > ~ ^ ^nd the 

SAD-profiles {iTrr,{u)}uev dual to converge pointwise to a limit {^(u)}„gv" 
as m —>■ 00. 

It is tempting to assume that in the case where no finite optimal move 
sequence exists, we could get away with an infinite move sequence instead of a 
sequence of finite move sequences 4) as described above. However this is not the 
case, see Example 3.6. 

Lemma 3.1 

Take the network G = {V, E) to be finite, and fix the target vertex v as well 
as the initial water profile. Then the existence of an optimal move sequence is 
guaranteed and the following simplification will not change its performance: In 
an optimal move sequence, without loss of generality we can assume pk = 5 for 
all k. 

Proof: By the very definition of k{v), the existence of optimal move sequences 
(however not necessarily finite ones) is guaranteed: Let A C [0,1]^ denote the 
set of achievable SAD-profiles. Its closure A in ([0,1]^, || . II2) is bounded and 
therefore compact. Given the initial water profile {?7o(u)}„gy, the function 

_ r[o,i]^^[o,G] 

I E iiu)vo{u) 

I uev 
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is continuous. Hence there exists a closed subset of H on which / achieves its 
maximum k{v) over A. The SAD-profiles dual to finite optimal move sequences 
are given by T" n A. If F n A = 0 and $ = m G N} is a collection of 

finite move sequences s.t. G (F x [0, and riT^{v) > k{v) — A for all 

TO G N, we can assume without loss of generality that the corresponding SAD- 
profiles {^T^iu)}uev have a limit {‘C(w)}uGy (by passing on to a subsequence 
if necessary) as A is compact. This turns <i> into an infinite type optimal move 
sequence and the limit of its dual SAD-profiles necessarily lies in F. 

Assume now that the first move in a sequence G (F x [0, is to open 
the lock on pipe ei = {x,y) for a time corresponding to G [0, j] in (1). 

Without loss of generality we can assume rio{x) > r]o{y) (which in turn im¬ 
plies r]i{x) > r]i{y)). If we look at the SAD-profile {^t-i{u)}ugv corresponding 
to p' := {p 2 , ■ ■ ■ tV’t) G (F X [0, - in effect we look at the outcome 

of the move sequence after the first step applied to the new initial water pro¬ 
file {r]i{u)}uev ~ we can distinguish two cases: either ^'rp_i{x) > ^'rp_i{y) or 

changing pi to 0, i.e. erasing the first move 
will not decrease the water level finally achieved at v, see (2). In the second 
case the same holds for changing pi to Since we can consider any step in 
the move sequence to be the first one applied to the intermediate water profile 
achieved so far, this establishes the claim for finite optimal move sequences. 

As any finite move sequence can be simplified in this way without worsening 
its outcome, the argument applies to the elements of a sequence , m G 

N} of finite move sequences and thus to infinite type optimal move sequences 
as well. □ 

3.1 Macro moves 

When it comes to the opening and closing of pipes, it is not self-evident how far 
things change if we allow pipes to be opened simultaneously. First of all one has 
to properly extend the model laid down in (1) by specifying how the water levels 
behave when more than two barrels are connected at the same time. In order 
to keep things simple, let us assume that the pipes are all short enough and of 
sufficient diameter such that we can neglect all kinds of flow effects. Moreover, 
let us take the dynamics to be as crude as can be by assuming that the water 
levels of the involved barrels approach their common average in a linear and 
proportional fashion, which is made more precise in the following definition. 

Definition 4 

Given a graph G = (V, F), let A C fo be a set of at least 3 nodes and Ea C F a 
set of edges inside A that connects A. A macro move on Ea (or simply A) will 
denote the action of opening all pipes that correspond to edges in Ea in some 
round k simultaneously and will - analogously to (1) - change the water levels 
for all vertices u G A to 

r]k{u) = {l-2pk)Vk-i{u) + 2pkVk-ii^), where %_i(A) = r]k-i{w) 

' ' we A 


is the average over the set A after round k — 1 and fXk G [0, ^]. 

First of all, Lemma 2.1 transfers immediately and almost verbatim to move 
sequences including macro moves: In a move sequence with a macro move on 
the set A in the first round, we get the water levels 

|(1 - 2^i) ? 7 o(u) + 2/ri r 7 Q(A) if u e A. 

If {^T-i(u), u&V} and {^r(u), u gV} are such that 

Vt{.v) = ^t{u) Voiu) = Y ^T-iiu) Vi{u), 

uGV uGV 

we find by comparing the coefficient of rjoiu) 


Ct(m) 


Ct-i(m) ifu^A 

(1 - 2/ri)^T-i(M)+ ifuG^, 


which is the SAD-profile originating from the very same macro move applied to 
{^T-i(w), u G V}. With this tool in hand, we can prove the following extension 
of Lemma 3.1: 

Lemma 3.2 

Take the network G = (V, E) to he finite, and fix the target vertex v as well as 
the initial water profile. 


(a) Even if we allow macro moves, the statement of Lemma 3.1 still holds true, 
i.e. reducing the range of from [0, to {0, in each round k does not 
worsen the outcome of optimal move sequences. 

(h) The sharp upper bounds on achievable water levels are not changed if we 
allow for pipes to be opened simultaneously. In other words, the supremum 
k(v) of water levels achievable at a vertex v, as characterized in Definition 
1, stays unchanged if we allow move sequences to include macro moves. 


Proof: 

(a) Just as in Lemma 3.1, we consider a move sequence consisting of finitely 
many (macro) moves - say again T G N - and especially the SAD-profile 
dual to the moves after round 1, denoted by {^t-i(u )5 u G V}. If the first 
action is a macro move on the set A, let us divide its nodes into two subsets 
according to whether their initial water level is above or below the initial 
average across A: 

Aa := {u G A, rioiu) >fjQ{A)} and Ab := {u G A, r]o{u) < fjQ{A)}. 

If Ct-i(m) < EuGAt ^T-i(u), changing pi to ^ will not decrease the 

final water level achieved at v. If instead J^u&Aa. ^t-i(w) > 
the same holds for erasing the first move (i.e. setting pi = 0). 
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(b) Obviously, allowing for pipes to be opened simultaneously can if anything 
increase the maximal water level achievable at v. However, any such macro 
move can be at least approximated by opening pipes one after another. 
Levelling out the water profile on a set of more than 2 vertices completely 
will correspond to the limit of infinitely many single pipe moves on the edges 
between them (in a sensible order). 

Let us consider a finite move sequence ip including macro moves on the 
sets Ai ..., A[ (in chronological order). From part (a) we know that with 
regard to the final water level achievable at v we can assume w.l.o.g. that 
all moves are complete averages (i.e. pLk = \ for all k). Fix £ > 0 and let us 
define a finite move sequence 'ip including no macro moves in the following 
way: We keep all the rounds in ip in which pipes are opened individually. 
For the macro move on Ai, i S {1,...,^}, we insert a finite number of 
rounds in which the pipes of an edge set Ea^ connecting Ai are opened in 
repetitive sweeps such that the water level at each vertex u G Ai is less than 
Y away from the average across Ai after these rounds. Note that Lemma 
2.2 guarantees that this is possible. 

As opening pipes leads to new water levels being convex combinations of the 
ones before, the differences of individual water levels caused by replacing the 
macro moves add up to ^ £ in the worst case. Consequently, the 

final water level achieved at u by is at most £ less than the one achieved 
by ip. Since £ > 0 was arbitrary, this proves the claim. 

Note however that the option of macro moves can make a difference when 
it comes to the attainability of k{v), see Example 3.6. q 


Remark 

Lemma 3.2 (a) states that even for macro moves, there is nothing to be gained 
by closing the pipes before the water levels have balanced out completely. A 
macro move on the edge set Ea with = \ can be seen as the limit of infinitely 
many single edge moves on Ea in the sense of Lemma 2.2 - a connection that 
does not exist for macro moves with pk € (0i |)- We believe that there always 
exists a finite optimal move sequence if macro moves are allowed. We state this 
as an open problem. 

Due to Lemmas 3.1 and 3.2 we can assume w.l.o.g. that the parameters pk 
in optimal move sequences are always equal to ^ in each round, hence omit 
them and consider the move sequence to be a list of pipes (i.e. ip G E"^) only. 
We can incorporate an optimal move sequence in which more than one pipe is 
opened at a time into Definition 3 by either allowing ipk, for k G {0,... ,T}, to 
be a subset of E with more than one element on which the levelling takes place 
or by viewing ip as & limiting case of move sequences {ip'^'^\ to € N}, in which 
pipes are opened separately, that form an infinite type optimal move sequence 
$ - as just described in the proof of the lemma. 

In the sequel however - if not otherwise stated - we will stick to the initial 
regime where pipes are opened one at a time. 
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3.2 Optimizing the move sequence 

Closely related to the water transport idea is the concept of greedy lattice animals 
as introduced by Cox, Gandolfi, Griffin and Kesten [2]. The vertices of a given 
graph G are associated with an i.i.d. sequence of non-negative random variables 
and a greedy lattice animal of size n is then defined to be a connected subset 
of n vertices containing the target vertex v and maximizing the sum over the 
associated n random variables. Since we do not care about the size of the lattice 
animal, let us slightly change this definition: 

Definition 5 

For a fixed graph G = {V,E), target vertex v and water levels {ri{u)}u^v^ l^t 
us call G CV a lattice animal (LA) for n if (7 is connected and contains v. G is 
a greedy lattice animal (GLA) for v if it maximizes the average of water levels 
over such sets. This average will be considered as its value 

GLA(n) ^ E 

By Lemma 2.2, it is clear that GLA(n) < k{v). In fact, for the majority of 
settings - consisting of a graph G, a target vertex v and an initial water profile 
{r]Q(u)}uev ~ strict inequality holds and we can do better than just pooling the 
amount of water collected in an appropriately chosen connected set of barrels 
including the one at v. 

Furthermore we know from Lemma 3.2 (a) that w.l.o.g. the last move of any 
finite optimal move sequence will be to pool the amount of water allocated in 
a connected set of vertices including v. This greedy lattice animal for v in 
the intermediate water profile created up to that point in time can be more 
advantageous than the one in the initial water profile if we imply the following 
improving steps first: 

1) Improving bottlenecks 

Let us call a vertex u a bottleneck of the GLA C for n if u G C \ {n} and 
ri{u) < GLA(n). Glearly, each bottleneck u has to be a cutting vertex for 
G (otherwise we could just remove it to improve the GLA). If there exists 
a connected subset of vertices Gu including u which has a higher average 
water level than Gu H C, the value of the GLA for v is improved if the 
water collected in Gu is pooled first (see Figure 2). Note that (7„ might 
involve more vertices from C than just u, see Example 3.5. 

2) Enlargement 

The second option to raise the value of the GLA C for v is to apply the 
idea above to a vertex u in the vertex boundary of G in order for the 
original GLA to be enlarged to a set of vertices in which u is a bottleneck. 
For this to be beneficial, there has to exist a connected set of vertices Gu 
in y \ (7 including u with the following property: The average water level 
in Gu is smaller than GLA(n) - otherwise it would be part of (7 - but is 


11 


raised above this value after improving the potential bottleneck u using 
water located in F \ C (see Figure 2 below). 




Figure 2: If A is the target vertex, the GLA on the left is {A, B, C} (having value 0.6) 
and the bottleneck B can be improved by first opening the pipe {B, D). 
The GLA for A with respect to the water profile on the right is {A}, but 
can be enlarged to {A, B, C} if the potential bottleneck B is improved by 
opening the pipe {B, D) first. 


3) Choose optimal chronological order 

When applying the improving techniques just described, it is essential to 
choose the optimal chronological order of doing things. Besides the fact 
that improving bottlenecks and enlarging the GLA has to be done before 
the final averaging, situations can arise in which different sets of vertices 
can improve the same bottleneck or the other way round that more than 
one bottleneck can be improved using non-disjoint sets of vertices, see the 
set-ups in Figure 3. 




Figure 3: If A is the target vertex, the GLA on the left is {A, B, C} (having value 0.5). 

Improving the bottleneck B can be done using D ox E and is most effective 
if the pipe {B,D) is opened first, then {B,E). 

The GLA for A with respect to the graph on the right is {A, B,C, D, E}. 
The water from E can be used to improve both bottlenecks B and D. It is 
optimal to open pipe {D,F) first and then {B,E). 
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Finally, it is worth noticing that lattice animals with lower average than GLA(r;) 
in the initial water profile sometimes can be improved by the techniques just 
described to finally outperform the initial GLA and its possible improvements 
and enlargements (see Example 3.5 and especially Figure 5). 


3.3 Examples 

Example 3.1 

The minimal graph which is non-trivial with respect to water transport is a 
single edge, in other words the complete graph on two vertices: 

G = iF2 = ({1,2}, {(1,2)}). 


By the considerations in the previous subsection, we get 


if ?7o(l) > ?7o(2) 
1^0(1)+^ if77o(l)<%(2). 


(3) 


Let the initial water levels be given by the two random variables Ui and U 2 - 
From (3) it immediately follows that 


Ui < k( 1) < max{C/i,G2}. 


If we assume Ui and U 2 to be independent and uniformly distributed on 
[0,1], a short calculation reveals the distribution function 

for 0 < a; < | 

X — \ — x)^ for f < a: < 1, 

which indeed lies in between Fjj-^^x) = x and F)nax{c/i,c/ 2 }( 2 ^) = 2 ;^, see Figure 4. 


-Et(i) (2^) 



Figure 4: On the left a visualization of P(k;(1) < x), on the right the distribution 
function of k(1). 

By symmetry, the exact same considerations hold for k(2). 
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Example 3.2 

The simplest non-transitive graph (i.e. having 
vertices of different kind, see Definition 8 ) is 
the line graph on three vertices: 

G = ({1,2,3},{(1,2),(2,3)}). 

Again by the above considerations, we find the supremum of achievable water 
levels at vertex 1 to be 

k{1) = max{77o(l), M^)+vo{2) ^ voW+vop+vois) ^ 

which is obviously achieved by a properly chosen greedy lattice animal. 
Consider the case in which the initial water levels satisfy 

??o(3) > ryo(2) > 77o(l) and 770 ( 3 ) > ?7o(l) (4) 

Then k( 1) = '>oC)+’?o^ 2 )+? 7 o( 3 ) there exists no finite optimal move sequence. 
This can be seen from the fact that any single move will preserve the inequalities 
in (4) and thus we have ?7 t( 1) < ^(1) < for all finite move sequences 

(f G E'^ . 

If we consider the initial water levels to be independent and identically 
distributed, the (random) supremum of achievable water levels at vertex 2 is 
stochastically larger than the one at vertex 1 : As 770 ( 1 ) and 770 ( 2 ) have the same 
distribution so do 

AC(1) and max { 770 ( 2 ), M^)+mi2) ^ vo{i)+vo{ 2 )+r,o{3) y 

The latter is less than or equal to k{2). The maximal value achievable by greedy 
lattice animals at vertex 2 is 

GLA(2) = max { 770 ( 2 ), ^ r,o{2)+m{3) ^ vo{i)+voi2)+M3) y 

The fact that we can average across one pipe at a time and choose the order of 
updates allows us to improve over this and gives 

k{2) = max{GLA(2), 1 ( 770 ( 1 ) + i (^^( 3 ) + !io(i)+!ZoM)|. ( 5 ) 

To see this, we can take a closer look on the SAD-profiles that can be created 
by updates along the two edges ( 1 , 2 ) and ( 2 ,3) starting from the initial profile 
^0 = (0,1,0): After one update - depending on the chosen edge - the profile is 
given by = (f, f, 0) or (0, ^, ^). After the second step we end up with either 
^2 = ( 5 , 213 ) or (j,|;,f). All of the corresponding convex combinations appear 
in the right hand side of (5). By Lemma 2.2, we know that continuing like this 
will finally result in the limiting profile (^, ^, ^). It is not hard to check that any 
sequence of two or more updates will lead to a monotonous SAD-profile with a 
largest value of at most f at one and a smallest value of at least j at the other 
end. For this reason, it can be written as a convex combination of (0,1,0) plus 
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either (i, \) and (0, or (|, i) and (i, 5,0). Consequently, it cannot 

correspond to a final water level at vertex 2 exceeding the value in (5). 

By an elementary calculation, for independent unif([0,1]) initial water levels, 
we obtain 

X min{2a: —s,l} 

^ k ( i )( 2 ;) = P(k( 1) < x) = J j min{3a; — (s + t), 1} dt ds 

s—0 t—0 

r^e[o,i] 

-fx^+^ix^-2x+l Ue[i,i] ’ 

+ 4x — I [x S [|, 1] 

and similarly 

X min{2ai—s,l} 

Fk( 2 )(x) = J J min{l, 2x — s, 3x — (s + t),4x — (2s + t), 2x — ^} dt ds 

s—0 t—0 

r2x3 fx G [0, i] 

= < -^x^ + 9x^ - 4x + ^ for ■\xG[i,|] , 

[ |x^ — 3x^ + 5x — I [2; G [|, 1] 



which is strictly smaller than J^,^(i)(x) implying k(1) ^ k(2 ), where ^ denotes 
the usual stochastic order. Due to the fact that adding the edge (1,3) will 
not give an improvement over the optimal move sequences for vertex 2, this 
stochastic domination already follows from the fact that k(1 ) is non-decreasing 
when adding an edge and the symmetry of K 3 . 

In fact, when optimizing the move sequence for the middle vertex we can 
neglect the option of levelling out the profile completely, since for any initial 
water profile there is a finite optimal move sequence ip G achieving 

??t(2) > I (??o(l) + ??o(2) + ??o(3)), 
as the next example will show. 

Example 3.3 

Given an initial water profile {?7o(u)}„gv and the complete graph Kn as under¬ 
lying network, we get for any v G V: 


i-i 

k{v) = 2 "'+^ T]o{v) + ^ 2 "* rjoivi), 

i=l 


where V is ordered such that 770(ui) > 770(772) > ••• > 770(u„) with v = vi. 
Furthermore, this optimal value can be achieved by a finite move sequence. 

To see this is not hard having Lemmas 2.1 and 2.3 in mind. If u = ui, the 
highest water level is already in v and the best strategy is to stay away from 
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the pipes. For v ^ vi, the contribution of vertex vi ~ i.e. the share in 

the convex combination of {r]Q(u)}uev optimizing rjT{v), see ( 2 ) - can not be 
more than ^ by Lemma 2.3. However, this can be achieved by opening the 
pipe {v,vi). According to the duality between water transport and SAD, this 
is what we do last. The argument just used can be iterated for the remaining 
share of ^ giving that V 2 can contibute at most | (given that vi contributes 
most possible) and so on. Obviously, involving vertices holding water levels 
below r]Q(v) can not be beneficial, as all vertices are directly connected, so we 
do not have intermediate vertices being potential bottlenecks. 

The optimal move sequence tp G E'^, where T = I — 1, is then given by 

‘Pk = {v,vi-k), k = 


leading to 

k 

T]k{v) = 2 "'= r]o{v) + 2 "'=+*"^ ??o(D-») 

i=l 

and consequently r]T{v) = rji-i(v) = k(v). Note that the option to open several 
pipes simultaneously is useless on the complete graph. Furthermore the above 
move sequence only includes edges to which v is incident, so the very same 
reasoning holds for the center u of a star graph on n vertices as well. 

To determine the optimal achievable value at v we have to sort the n initial 
water levels first. This can be done using the randomized sorting algorithm 
‘quicksort’ which makes 0(n log(n))) comparisons on average, O(n^) in the 
worst case. The calculation of k{v) given the sorted list of initial water levels 
needs at most n — 1 additions and n — 1 divisions by 2 . 

Example 3.4 

Expanding Example 3.2, let us reconsider the line graph - this time not on 
three but n vertices. Let the vertices be labelled 1 through n and let vertex 
1 (sitting at one end of the line) be the target vertex. Given an initial water 
profile k(1 ) can be determined by 2 n —2 arithmetic operations (n—1 

additions, n — 1 divisions) as it turns out to be 

1 ‘ 

«(1) = ma? 7y!»7o(*)- (6) 

KKn L 
-2=1 

In other words, k( 1) is achieved by averaging over the greedy lattice animal for 
vertex 1 with respect to the initial water profile (see Definition 5). 

In order to establish this, let us first define for any water profile {riT{i)}2=i 
achievable from {??o(0}r=i finite time T G Nq) the corresponding normed 
vector 

■= ii (r?T(l),.--,?7T(«)), 

which can be understood to be a probability measure on {!...,n} - as ( 1 ) 
preserves the total mass M := %(*)■ 
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Related to this construction, there are two important observations to make: 
Firstly, if we consider two water profiles on {1... ,n} with the same total mass 
M and their corresponding probability measures being such that one stochas¬ 
tically dominates the other, this relation is preserved when the two profiles are 
exposed to the same update of the form (1), see also La. 2.4 in [4]. Secondly, 
if {r]k+i{i)}2=i arises from {rik{i)}'i=i by an update of the form (1) along an 
edge {u, M -I- 1), where the water level at u -|- 1 is higher than the one at u, the 
corresponding probability measure A^+i is stochastically dominated by Afc. 

If we choose to open only this kind of pipes (where an update brings water 
closer to vertex 1) in a sensible order - in permanent sweeps from (1,2) to 
{n—l,n) for example - we get a stochastically decreasing sequence of probability 
measures {Xk)keNo: whose limit is stochastically dominated by any measure 
jg (77 t(1), ■ ■ ■,VT{n)), where T e No and {r]T{i)}2=i was created by updates of 
the form (1) successively applied to the initial profile {?7o(*)}r=i- 

Furthermore, following this update scheme the water profiles are tending to 
a piecewise constant profile: Any relation “the barrel at vertex x holds at most 
as much water as the one at a: -I- 1” will be preserved if we only open pipes 
{u,u + 1), where u -I- 1 has a water level higher than the one at u. For that 
reason, already the initial water profile determines two sorts of pipes: If L is 
minimal in respect of 


1 L i 

i=l -i=l 

and L > u, the water level at u -I- 1 will eventually be higher than the one at u 
causing infinitely many updates along (u, m-|- 1 ) (or the very same water level at 
u and u -|- 1) in the sequel. If instead L < u, either the water level at u is always 
at least as high as at u -I- 1 and (it, it -|- 1) will never be opened or the barrel 
at It is at some point emptier than the one at it -I- 1 leading to infinitely many 
updates along (it, u -I- 1) or eventually the same water level at it and it -|- 1. This 
establishes the claimed shape of the limiting profile (according to Lemma 2.2). 
Its optimality can be seen as follows: The probability measure Ay corresponding 
to an arbitrary achievable profile {t7T(*)}"=! dominates Xk for k large enough, 
hence 

7?t(1) = M ■ At(1) < M ■ Afc(l) = 77fe(l) < lim 77 ^( 1 ), 

k—¥oo 

and (6) follows. 

If we allowed macro moves (opening several pipes simultaneously), the first 
(and only) move would be to open the pipes (1, 2),..., (L — 1, L). 

Example 3.5 

Finally, let us consider the line graph on n vertices, with the target vertex v not 
sitting at one end. 


12 3 


V 


n — 1 


n 
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Given the initial water levels {r]Q{u), 1 < u < n}, let us consider the final 
SAD-profile {^(M)}i<ii<n corresponding to an optimal move sequence (if the 
move sequence is of infinite type, it is the limit of its dual SAD-profiles we are 
talking about, cf. Lemmas 2.1 and 3.1). 

First of all, from Lemma 2.4 (a) we know that any achievable SAD-profile 
on line graphs is unimodal (which therefore holds for a pointwise limit of SAD- 
profiles as well). Let us denote the leftmost maximizer of {C(M)}i<u<n by q and 
set 


I := minjl < u < n, ^(u) > 0} and r := maxjl < u < n, ^{u) > 0}. 

By symmetry, we can assume without loss of generality l<v<q<r-ii 
q < V, the set-up is merely mirrored. Furthermore, let us pick the optimal move 
sequence such that {^(u)}i<„<„ minimizes the distance q — v. 

The contribution from the nodes {q, g -|- 1,..., n} can be seen as a scaled- 
down version of the problem treated in the previous example: This time the 
drink to be shared does not amount to 1 but to J2q<u<r^i'^) instead. From 
Example 3.4 we can therefore conclude that a fiat SAD-profile i.e. 

?('?) =^(<?+l) = •■• =C(0 (7) 

is optimal. The same holds for the contribution coming from {1, 2,..., u — 1}, 

i.e. 

ai) = ai + i) = --- = av-i)- m 

In addition to that, from Lemma 2.4 (c) we know ^(r) < ■ 

li I = V, part (b) of Lemma 2.4 in turn implies v = q. The SAD-profile then 
features only one non-zero value (namely y3yqr[) and corresponds to the greedy 
lattice animal for v consisting of the vertices v,v + 1,... ,r. If instead I < v - 
compared to the balanced average across {u, u -|- 1 ,..., r} just described - the 
contribution to the final water level at v (cf. (2)) given by 

v—1 r 

^^(u)?7o(m) replaced the contribution ^ — C(^)) %(^)i (9) 

u—l u—v 

where necessarily C(w) = Ylu=v ( r--hi As q is a mode and 

due to (7) we have 

- C(t') > ... > - ?(<?) = ■ •. = - i{r). (10) 

The aforementioned replacement is most beneficial if the weighted average to 
the right in (9) is made as small as possible, keeping M fixed. In view of (10) 
we can conclude, applying again the ideas from the foregoing example - this 
time think of the initial profile C — rjQ (u) considered for v < u < r only - that 
this is achieved once more by a balanced average. Hence I < v implies v < q 
and the just mentioned balanced average has to stretch to the right as far as 
g — 1, i.e. ^{v) = ... = ^(g — 1) < = ^(g), since otherwise g would 
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not be the leftmost mode. From this and (8) we find M = [v — 1) ■ ^{l) = 
(q-v)- 

The assumption that q — v was minimized when picking the optimal move 
sequence considered, forces 

v—l q—1 

u—l u—v 

since otherwise the balanced average across {v,v + 1 ,... ,r} would have been 
at least as good. Connecting v to barrels to the left consequently yields an 
improvement of the final water level at v (in comparison to Voi^)) 

to the amount of 

^ e(u) ??o(u) = M- ^ 77o(u) - ^ • 

u=l u=v u=l u=v 

As a consequence, M must be as large as possible for an optimal move sequence, 
which means ^{l) = ^{q—1) and makes {^(u)}i<„<„ a piecewise constant profile 
taking on two non-zero values, ^{l) and ^(r), as depicted below. 



Note that the value ^(r) = (and so even ^{1)) is already determined by 

the choice of I, q and r. In Figure 5 below, a set of initial water levels on the line 
graph comprising 15 nodes is shown, for which the SAD-profile corresponding to 
an optimal move sequence is the one shown above. Furthermore, it can be seen 
from this instance that the GLA with respect to the initial water profile and 
its possible enhancements can be outperformed by improving another lattice 
animal as mentioned at the end of Subsection 3.2. 


GLA 



I V Q r 


Figure 5: Even for a graph as simple as the line graph, the initial GLA sometimes has 
little to do with the optimal move sequence. 

When it comes to the complexity of finding k{v), we can greedily test all 
choices for l,q,r ~ of which there are less than n^. For each choice at most n + 3 
additions/subtractions and four multiplications/divisions have to be made to 
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calculate either 


9-1 


(9 


-l)\r-v+l) T, Vo{u) + E Mu) or 


u—l 


u—q 


Q 

E vo{u) + E Mu), 

u—l u—q-\-l 


( 11 ) 


depending on whether v < q or q < v, where q is the rightmost mode of 
{?(ii)}i<u<n- Even if there might exist SAD-profiles with q < v < q corre¬ 
sponding to optimal move sequences, by the above we know that there has to 
be one with either v<q or q<vas well. The maximal value among those 
calculated in ( 11 ) equals k{v), so the complexity is 0{n*). 

Example 3.6 

The preceding example can serve to give a concrete instance in which even 
an infinite sequence of single edge moves can not achieve the supremum as 
mentioned after Definition 3. 

Consider the line graph on four vertices, the target 
vertex not to be one of the end vertices and initial 
water levels as depicted to the right. ^ 

From Example 3.5 we know that the optimal SAD-profile will allocate | of 
the shared glass of water to each of the vertices to the left of v and v itself, 
the maximal amount of ^ to the rightmost vertex q - showing that k{v) = 0 . 6 : 
First, recall that any SAD-profile on a line graph is unimodal. If q is not the 
(only) mode, the contribution of v and q has an average of at most 0.5 and thus 
the SAD-profile in question yields a water level at v of at most 0.5 - see (2). 
If q is the mode, the SAD-profile is non-decreasing from left to right and thus 
a flat profile on the vertices other than q uniquely optimal. Finally, to achieve 
the optimum, the contribution of q has to be maximal, i.e. ^ (see Lemma 2.3). 

From the considerations in Thm. 2.3 in [4] it is clear that this SAD-profile, 
more precisely the value | at q, can only be established if the first move is 
V sharing the drink with q (which corresponds to the last move in the water 
transport - see Lemma 2.1). Once v starts to share the drink to the left, any 
other interaction with q will decrease the contribution of the latter and thus put 
a water level of 0.6 at v out of reach. 

To get a flat profile on three vertices, we need however infinitely many single¬ 
edge moves (here on ei and 62 ). An infinite type optimal move sequence is for 
example given by 


$ = m G N}, where 

= (ei, 62 , 61 , 62 ,...,(u,g)). 




(m) 


G , Tm = 2m -I- 1 and 


achieving 


lim rjTmiu) = 0.6 = k{v), 

m—>-oo ^ 


a value that can not be approached by any stand-alone (finite or) infinite se¬ 
quence of moves. 
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If we allow macro moves, however, there is a two-step move sequence achiev¬ 
ing the water level 0.6 at v: First we open the pipes ei and 62 simultaneously 
to pool the water of the vertices other than q and in the second round, we open 
the pipe (u, q). 

4 Complexity of the problem 

In this section, we want to build on the complexity considerations for the water 
transport on finite graphs from the examples in Section 3. In fact, we want 
to show that the task of determining whether k{v) is larger or smaller than a 
given constant - for a generic set-up, consisting of a graph, target vertex and 
initial water profile - is an NP-hard problem. This is done by establishing the 
following theorem: 

Theorem 4.1 

The NP-complete problem 3-SAT can he polynomially reduced to the decision 
problem of whether k{v) > c or not, for an appropriately chosen water transport 
instance and constant c. 

Before we deal with the design of an appropriate water transport instance in 
order to embed the satisfiability problem 3-SAT, let us provide the definition of 
Boolean satisfiability problems as well as known facts about their complexity. 

Definition 6 

Let X = {xi,X 2 , ■ ■ ■ ,Xk} denote a set of Boolean variables, i.e. taking on logic 
truth values ‘TRUE’ (T) and ‘FALSE’ (F). If x is a variable in X, x and x are 
called literals over X. A truth assignment for A is a function t : A —>■ {T,F}, 
where t(x) = T means that the variable x is set to ‘TRUE’ and f(x) = F means 
that X is set to ‘FALSE’. The literal x is true under t if and only if t{x) =T,x 
is true under t if and only if f(x) = F. 

A clause C over A is a disjunction of literals and satisfied by t if at least one 
of its literals is true under t. A logic formula F is in conjunctive normal form 
(CNF) if it is the conjunction of (finitely many) clauses. It is called satisfiable 
if there exists a truth assignment t such that all its clauses are satisfied under t. 

The standard Boolean satisfiability problem (often denoted by SAT) is to 
decide whether a given formula in CNF is satisfiable or not. If we restrict to the 
case where all the clauses in the formula consist of at most 3 literals it is called 
3-SAT . 

3-SAT was among the first computational problems shown to be NP-com- 
plete, a result published in a pioneering article by Cook in 1971, see Thm. 2 in 
[ 11 - 

Let us now turn to the task of embedding 3-SAT into an appropriately 
designed water transport problem that is in size polynomial in n, the number 
of clauses of the given 3-SAT problem: 

Given the logic formula F = Ci A (72 A... A (7„ in which each of the clauses Ci 
consists of at most 3 distinct literals, let us define the comb-like graph depicted 
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in Figure 6. All the white nodes, plus the target vertex v, represent empty 
barrels. The other ones that are shaded in blue contain water to the amount 
specified. 



connecting 

paths 



Figure 6: A polynomial reduction of 3-SAT to the water transport problem. 

The comb has k teeth, where k is the number of variables appearing in F. 
Each individual tooth is formed by a line graph on 240 n‘^ — 1 vertices with water 
level 1 each. The lower endvertex of the Ah tooth is connected to two vertices 
representing the literals Xi and xf, having water level 2 respectively. In between 
the teeth there are k—1 link nodes, each of which features itself a water level of 2 
and is connected to the four nodes representing literals of consecutive variables 
- more precisely, the link node in between tooth i and i + 1 is connected to 
the vertices Xi,Xi, Xi+i,Xi+i, for i € {1,..., fc — 1}. The vertices representing 
Xk , Xk are connected to the rightmost link node as well as to an additional vertex 
featuring a water reservoir of level |. Left of the first tooth, there is another 
link node (with water level 2 as well) connected to Xx and xj" as well as by a 
path to the shaft of the comb, which is described next. 

The comb’s shaft is made up of a line graph on 2n + 2 vertices, with the 
target vertex v to the very right. To the left of v there is a vertex representing 
a barrel with water level 3 followed by n (empty) barrels that stand for the 
clauses Ci,..., and are seperated by a vertex with water level 3 respectively. 
The left endvertex (connected to Ci) features a water level of 3 as well and 
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is connected to the leftmost link node as mentioned before, namely via a path 
consisting of 4n — 1 nodes with water level 2 each. 

Finally, the teeth are connected to the shaft through (disjoint) connecting 
paths from nodes representing literals to nodes representing clauses, where for 
example X 2 is linked to C 2 by a path if it appears in this clause. Each of these 
paths is formed by a line graph on 120 — 1 vertices representing empty barrels. 

Note that each clause-node is linked to at most 3 connecting paths, whereas the 
number of connecting paths originating from a vertex representing a literal can 
vary between 0 and n. 

In connection with the water transport problem originating from a 3-SAT 
formula F as depicted in Figure 6, we claim the following: 

Proposition 4.2 

Consider the water transport problem based on the logical formula F, given by 
the graph, target vertex and initial water profile as depicted in Figure 6. 

(a) If F is satisfiable, then the water level at v can be raised to a value strictly 
larger than 2, i.e. k{v) > 2. 

(b) If F is not satisfiable, then this is impossible, i.e. k{v) < 2. 

Before we deal with the proof of the proposition, note how it implies the 
statement of Theorem 4.1: First of all, if E is a 3-SAT formula consisting of n 
clauses, k cannot exceed 3n. Given this, it is not hard to check that the graph in 
Figure 6 has no more than 720 n® -|- 360 -|- 9 n -I- 2 vertices and maximal degree 

at most n -I- 3 (or 5 if n = 1). As the initial water levels are all in {0,1, 2,3, ^}, 
the size of this water transport instance is clearly polynomial in n. Due to the 
fact that the value of k{v) can be used to decide whether the given formula F 
is satisfiable or not - as claimed by Proposition 4.2 - Theorem 4.1 follows. 

Proof of Proposition 4.2 (a): To prove the first part of the proposition, let 
us assume that F is satisfiable. Then there exists a truth assignment t with the 
property that all clauses Ci,..., C„ contain at least one of the k literals that are 
set true by t. Those can be used to let the water trickle down from the teeth to 
the line graph at the bottom in an effective way: We assign each clause to one 
of the true literals under t which it contains. Then, we average the water over k 
(disjoint) star-shaped trees. Each such tree has a literal x € {xi,Xi, ..., ccfc,5T} 
that is true under t as its center and the top node of the tooth above x as well 
as the nodes representing the clauses assigned to x as leaves (where the clause- 
nodes are connected to x in the tree via the corresponding connecting paths). 
If TO clauses chose x, there are 240 -I- to • 120 vertices in the tree and the 

water accumulated amounts to 240 71"^ -I- 1. 

By pooling the water along those trees, all the nodes corresponding to clauses 
can simultaneously be pushed to a water level as close to the average of the 
corresponding trees as we like (see Lemma 2.2). As to < n, we can bound these 
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averages from below by 


240 n"* + 1 ^ 240 + 1 ^ ^ ^ 

240n4 + 120mn2 “ 240+ 120^ ~ 

So after this procedure, each clause-node will have a water level strictly larger 
than 1 — Note that only one of each pair {xi,^} was used as a water 
passage, so there is still a line graph - let us call it linking path - consisting of 
vertices with water level 2 exclusively, from the leftmost link node to the vertex 
with initial water level | through all link nodes and the untouched literals (the 
ones that are false under t). 

By another complete averaging - this time over the line graph that consists 
of the shaft (i.e. the line graph at the bottom in Figure 6 ), the path to the 
very left connecting the leftmost link node to the shaft, the linking path just 
described, as well as the reservoir with level | at the other end - will push the 
water level at v beyond 

en+ 2 k +2 (i + (2A:-k4n- 1) • 2 -k (n-f 1) • 3-kn(l - = Q^_^2k + 2 ^ 

Consequently, for the case of satisfiable F we verified for the graph depicted in 
Figure 6 : k{v) >2. □ 

In the proof of the second part of the proposition, we need a rough estimate 
of how much the water level in a vertex representing a clause can be raised, if 
only accessed via connecting paths. This is done in the following lemma. 

Lemma 4.3 

In the comb-like graph depicted in Figure 6, it is impossible to push the water 
level in a clause-vertex above the value o/ 1 -k ^ without opening the pipes to 
its left or right neighbor. 

Proof: The proof of this claim is a simple comparison with a tree similar to 
the structure above the node corresponding to some clause Ci. Originating from 
Cl, there are at most 3 connecting paths that lead to three nodes representing 
literals. Initially, the node corresponding to C/ and the ones on the connecting 
paths are empty. Their water level can be raised to almost 1 using water from 
the teeth of the comb and further using nodes with initial water level higher 
than 1. The fact that opening pipes always produces convex combinations of 
the involved water levels (see ( 2 )) guarantees that the total amount of water 
above a fixed level - cumulated over all barrels - is non-increasing when pipes 
are opened. Initially, the cumulated amount of water above level 1 in the whole 
graph is 

(| - 1) -k 3 fc • (2 - 1) -k (4n - 1) • (2 - 1) -k (n -k 1) • (3 - 1) < 15 n -k |. ( 12 ) 

For n £ N, this is clearly less than 20n — 1. 

We can mimick any attempt raising the water level at Ci in the comb-graph 
via its connecting paths in the tree depicted to the right in Figure 7 in such a 
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way, that the water levels at Ci and on its connecting paths are at any point in 
time at most as high as the ones in the corresponding part of the comparison 
tree: If water is routed into the connecting paths above Ci but water levels do 




Figure 7: Comparison of the structure above the node representing a clause Ci in the 
comb-graph with an appropriately tailored tree. 


not exceed 1 (e.g. when routing water down from the teeth) we do nothing in 
the comparison tree. If water from the vertices with initial water level above 
1 is introduced into the connecting paths, we introduce the same amount to 
the corresponding connecting paths in the tree (note that this is possible, as 
the total amount of water above level 1 in the comb-graph is available in all 
three leaves of the tree). Every move involving only nodes from the connecting 
paths depicted and Ci is copied in the tree. This retains the property that the 
water levels in the tree are not less than the ones in corresponding nodes of the 
comb-graph and shows that the highest water level achievable at Ci in the tree 
is an upper bound on the level achievable in the comb-graph. If there are less 
than 3 connecting paths above Ci in the comb-graph we can either modify the 
comparison tree accordingly or just not use the extra branches. 

By the generalization of Thm. 2.3 in [4] to trees, see the comment after 
Lemma 2.4, we know that the contribution to the convex combination at Ci 
from the leaves in the tree is at most 1 divided by the graph distance plus one, 
i-e. inn ^2 11 ■ The water level at Ci in the tree can therefore not exceed 

12Un^ + l 


1 n 20n ^ I 

I+ 3-z- < IH-, 

I20n2 -b I “ 2n 
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which induces the claim. 


□ 


Note that the same argument with only water to the amount of 5 n — 1 above 
level 1 available in the leaves would give the upper bound of 1 + ^ ^ > 

which will be used in the proof of Proposition 4.2 (b) as well. 

Proof of Proposition 4.2 (b): To check that in case F is not satisfiable 
we get k(v) < 2 is a bit more involved than the first part: Let us assume the 
contrary. Then there exists a finite move sequence (involving macro moves say) 
that achieves a final water level 77 t(t) > 2. By the idea in part (a) of Lemma 3.2 
we can assume the last move to be the complete average over a connected vertex 
set A including v. The only barrels with initial water level larger than 2 are the 
ones left of each clause-node and of v plus the reservoir. Including any node 
apart from these into the set A, when trying to achieve rjT(y) > 2, can therefore 
only be beneficial if it is a bottleneck (see the discussion after Definition 5). 

Structurally speaking, there are three potential candidates for such a set A: 

• a set containing some vertex from a connecting path 

• a set containing only vertices from the bottom line graph or 

• a set containing the reservoir vertex but no connecting path. 

Note that the set we used in the case of satisfiable F was of the third type. We 
will see in a moment that this is in fact the only relevant candidate for the set A 
in the sense that the other two do not allow to raise the water level at v above 
the value of 2, even for a satisfiable formula F. 

The first candidate listed is ruled out rather easily: If A contains a vertex 
from a connecting path, the bottleneck argument forces A to contain the whole 
corresponding connecting path (recall: a bottleneck has to be a cut vertex 
between barrels with water levels above average and the target vertex). Then 
A is of size at least 120 and the amount of water above the water level of 
1 is just not sufficient to fill up so many vertices to a level of two: From (12) 
we know that the water available above level 1 in the whole graph is at most 
15 n -f I initially and non-increasing. The amount in a whole connecting path 
with water level 2 would be 120 n? — 1, so this can definitively not be achieved. 

Next, let us assume that A is a subset of the vertices of the bottom line 
graph - including vertex v and m clauses. Again by the bottleneck argument, 
we can assume that the leftmost node in A is not a clause-node, i.e. has initial 
water level 3 (see Figure 8). 



c„ —m+1 C „-1 c„ 

Figure 8: Vertices of the set A considered in the second case. 
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With the intention to increase the total amount of water inside A before the 
final averaging, one can try to fill up the clause-nodes. However, from Lemma 
4.3 we know that the water level at the clause-nodes (being bottlenecks) in A 
cannot be pushed much above the level of 1, if accessed via connecting paths 
only. Further, this makes accessing vertex v through a connecting path and 
Cn unfavorable. Note that opening the pipes in the bottom line graph in order 
to connect barrels representing clauses inside A to the ones with water level 2 
or 3 outside A might increase the amount of water in A as well, but will raise 
the water level at the involved clause-nodes to a level that can not be further 
improved by using links via connecting paths, so it is most beneficial to fill up 
the clause-nodes with water routed through connecting paths first. 

Let us assume that after this first phase, we managed to achieve a water 
level of 1 -I- ^ at Cn-m+i, ■ ■ ■ ,C„. This might be technically impossible, but 
surely dominates the water levels achievable using the connecting paths only 
and simplifies our further considerations. Staying away from the connecting 
paths, water can only be routed into A via Cn-m- If we average over all nodes 
in A once while doing so, the final averaging is meaningless (because then the 
effect of any move between this and the last move will be increased if again 
all pipes inside A are opened). However, since the last move has to involve v 
we can assume that any move before leaves the pipe on the edge incident to v 
closed - and thus w.l.o.g. the pipe from Cn towards v as well. 

This in turn requires that the connected subset of nodes outside A (incident 
to the leftmost node in A) that pools its water with a connected subset inside A, 
including Cn-m+i but not v, has an average of at least 2-|- as the amount of 
water inside A would decrease otherwise. In view of Lemma 4.3, the only useful 
move is therefore to open the pipes along the shaft and through the nodes with 
initial water level 2 (which they actually might have lost during the first phase) 
in order to connect the vertex with water reservoir ^ to A. No matter which of 
the nodes representing literals we include, the water levels in the path connecting 
the leftmost link node to the shaft and the shaft itself will be dominated by the 
ones obtained if we pretend that the water above level 2 from the reservoir can 
be transferred to the leftmost link node without any losses. 

Starting with a water level of | in the leftmost link node instead, we might 
increase the amount of water inside A by at most another | • 2 m+ 4 n — h 
the path has to involve at least An nodes outside A, the subset inside A is of 
size at most 2 m < 2 n and already has an average of at least 2). Along a line 
graph, the contribution of the water level from an endvertex to the ones formed 
as convex combinations along the line is decreasing with the graph distance (see 
Lemma 2.4 (b)). 

Despite our greatest efforts, the total amount of water in A will consequently 
not exceed the value 3 (m -I- 1) -I- to (1 -I- ^) + ^ < 4 (to -|- 1). Since A consists 
of 2m -I- 2 vertices, leveling out across this set will possibly raise the amount of 
water in the barrel at v to the level of 2, but not beyond. 

Finally, consider A to contain all of the shaft as well as the vertex with water 
reservoir but no vertex from a connecting path. Then A, being connected. 
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has to contain the path that consists of 4 n — 1 vertices, connecting the shaft to 
the leftmost link node, as well as a linking path through link nodes and vertices 
representing literals as described above. However, this time - with F being not 
satisfiable - it is impossible to fill up all the clause-nodes to a level of about 
1, leaving at least one path between the reservoir and the leftmost link node 
unaffected: In order to reach all clause-nodes, we have to use both Xi and xi as 
water passage for at least one i S {1,..., k}. 

In comparison to the case of satisfiable F, we will lose an amount of at least 
1 — ^ for each clause-node that is not reached before the final averaging - but 
likewise an amount of at least 1 in the linking path for each pair {xi,Xi} in which 
both nodes were used as water passage - since their water level of 2 reduces to 
something less than 1 when water from the tooth above is routed through the 
node all the way down to a clause-node. By the same token as in Lemma 4.3, the 
clause-nodes can be filled up to a level of at most 1 -I- ^ through the connecting 
paths, as the water available outside A above level 1 is fc (from the literals not 
part of the linking path) plus 1 from a vertex representing a literal on the linking 
path if we need to route through such (and k + 1 < 5n — 1). Note that moving 
water from inside A through a connecting path to a clause will in fact reduce 
the amount of water in A. Consequently, the set A (which is the same as the 
one chosen for satisfiable F) still contains 6n -I- 2fc -|- 2 vertices, but the amount 
of water we can allocate in A is at most 

|-|-(2fc-|-4n — l)-2-|-(n-|-l)-3-|-n(l-|-^) — (1— 

= 12n-|-4fc-|-^-|-^ 

< 12 n-I-4 fc-I-4, 

for n > 2. Thus, even in this manner we can not raise the water level at vertex 
u to a level of 2 or above if F is not satisfiable which contradicts the above 
assumption and in consequence verifies k{v) < 2 for this case. □ 

As already mentioned, this shows that solving the decision problem “k{v) > 2 
or k{v) < 2” for the comb-like graph depicted in Figure 6 solves the correspond¬ 
ing 3-SAT problem as well. Since 3-SAT is an NP-complete problem, we hereby 
established that any problem in NP can be polynomially reduced to a decision 
problem minor to the computation of k{v) in a suitable water transport instance 
- showing that computing k(u) in general is indeed an NP-hard problem. 


5 On infinite graphs 

This last section is devoted to the water transport problem on infinite graphs. 
We consider an infinite, connected, simple graph G = {V, E) with bounded 
maximal degree. The initial water levels {?7 o(w)}ugv &re considered to be i.i.d. 
with a (non-degenerate) common marginal distribution concentrated on [0,(7], 
for some (7 > 0. The supremum k{v) of achievable water levels at a fixed target 
vertex v G V depends on the initial water levels of course, which makes it a 
random variable as well. 
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When the vertices of an infinite graph are assigned individual values, the 
most natural definition of a global average across the graph is to look at a 
fixed sequence of nested subsets of the vertex set, with the property that every 
vertex is included eventually, and then consider the limit of averages across 
those subsets (if it exists). 

Given i.i.d. initial water levels, the strong law of large numbers tells us that 
the randomness of the global average - which is non-degenerate on finite graphs 
- becomes degenerate if we consider infinite graphs, where it will a.s. equal 
the expectation of the marginal distribution. k(v) however shows a slightly 
different behavior: In order to determine whether the supremum of achievable 
water levels at a given vertex v is a.s. constant or not, we have to investigate 
the global structure of the infinite graph a bit more closely. 

If the graph contains a half-line with sufficiently many extra vertices attached 
to it, the distribution of k{v) becomes degenerate for all u G F - as stated in 
Theorem 5.1 and the final remark: One can in fact, with probability 1, push the 
water level at v to the essential supremum of the marginal distribution. The 
infinite line graph however is too lean to feature such a substructure and behaves 
therefore much more like a finite graph, in the sense that the distribution of k{v) 
is non-degenerate - see Theorem 5.3. In order to evolve these two main results 
of this section, let us first properly define what we mean by “sufficiently many 
extra vertices”. 

Definition 7 

Let G = (y, E) be an infinite connected simple graph. It is said to contain a 
neighbor-rich half-line, if there exists a subgraph of G consisting of a half-line 

H = [{vk, k G N},{(ufc,ufc+i), k G N}) 

and distinct vertices {uk, fc G N} from V \ {vk, fc G N} such that there is an 
injective function / : N —>■ N with the following two properties (cf. Figure 9): 

(i) For all fc G N: {uk,Vf(^k)) G i-®- th® vertices Uk and uj(fc) are neighbors 
in G. 

(ii) The function / is growing slowly enough in the sense that jfk) 

verges. 


Ml U2 U-i U4 Ms 

• f f • 


Vl V2 V3 V4 V5 Ve V7 Vs V9 Via Mil Mi2 M13 


Figure 9: The beginning part of a neighbor-rich half-line. 


Note that - by a renumbering of {uk, k G N} - we can always assume 
the function / to be (strictly) increasing. Furthermore, if G is connected and 
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contains a neighbor-rich half-line, we can choose any vertex G to be its 
beginning vertex: If vi is the vertex with highest index at shortest distance to 
V in H, replace (rii, ... ,vi) by a shortest path from v to vi in H. The altered 
half-line will still be neighbor-rich, since for all M, N G N and / as above: 


E 




1 

m 


= oo 


E 


k=M 


1 

f{k) + N 


= oo. 


With this notion in hand, we can state and prove the following result: 

Theorem 5.1 

Consider an infinite (connected) graph G = (V,E) and the initial water levels 
to be i.i.d. unif([0,1]). Let v G V he a fixed vertex of the graph. If G contains a 
neighbor-rich half-line, then k{v) = 1 almost surely. 


Before embarking on the proof of this theorem, we are going to show a 
standard auxiliary result which will be needed in the proof: 

Lemma 5.2 

For £ > 0, let (yfc)fcgN ^6 i.i.d. sequence having Bernoulli distribution with 
parameter e. If the function / : N —^ N js strictly increasing and such that 
Jfk) diverges, then 


OO 


E 


Yk 

f{k) 


= oo 


almost surely. 


Proof: Let us define 




n 


n 


E 


Pfe-e 

f{k) 


for all n G N. 


As the increments are independent and centered, this defines a martingale with 
respect to the natural filtration. Furthermore, 


E(A2) 


f-E(rfe-£)2 

k 


n 2 

k “S' 


By the L^’-convergence theorem (see for instance Thm. 5.4.5 in [3]) there exists 
a random variable X such that converges to X almost surely and in L^. 
Having finite variance, X must be a.s. real-valued and due to 


E 


k=i 


Yk 

fik) 


- A„ = £ • ^ 


1 

wr 


the divergence of J2T=i J(i^ forces ^ almost surely. 


□ 


Proof of Theorem 5.1: Given a graph G with the properties stated and a 
vertex v, we can choose a neighbor-rich half-line H with v = vi and the set 


30 










of extra neighbors {unjnGN described in and after Definition 7. The initial 
water levels at {««}«£« are i.i.d. unif([0,1]), of course. 

Depending on the random initial profile, let us define the following SAD- 
procedure starting at v: Fix e, 5 > 0 and let be the increasing (random) 

sequence of indices chosen such that the initial water level at uat, is at least 1 — e 
for all 1. Then define the SAD-procedure - starting with ^o{v) = 1, ^o{u) = 0 for 
alluS - such that first all vertices along the line (ui, U 2 , -.., ), mat J 

exchange liquids sufficiently often to get 

2 ^ 

and never touch again. Note that by Lemma 2.2, ^k{uNi) can be pushed as 
close to as desired in this way. At time ki, the joint amount of water in 

the glasses at ui, U 2 ,..., ) equals 1 — (uatj ) and we will repeat the same 

procedure along {vi,V 2 , ■ ■ ■, uy(Ar 2 ), uatj) to get 

ik 2 {uN 2 ) > 2 ’ ~ for some ^2 > ki 

and iterate this. 

After m iterations of this kind, the joint amount of water localized at vertices 
of the half-line H equals 1 — ^kiiuNi), which using 1 — cc < e“® can be 

bounded from above as follows: 


^-'^^ki {uNt ) ^ n ( 1 “ 


1=1 


1=1 


< exp - ^ 


1 

f{Ni) + 2 

1 


1^1 


f{Ni)+2j- 


(13) 


Defining Yk := l{r;o(ufc)>i-e} for all A: e N we get {Yk)kGti i-i.d. Ber(e) and can 
rewrite the limit of the sum in the exponent as follows: 


OO ^ 

^7M+2 


OO 


E 


Yk 

fik) + 2- 


This allows us to conclude from Lemma 5.2 that the exponent in (13) tends a.s. 
to — OO as TO —t OO. Consequently, to,T S N can be chosen large enough such 
that with probability 1 — 5 it holds that 

m 

{uNi ) > 1 - £ and km < T. 

1^1 

Given this event, the move sequence corresponding to the SAD-procedure 
just described - adding no further updates after time km, i.e. fik = 0 for k > km, 
if km <T ~ then ensures (see Lemma 2.1) that 

m 

i1t{v) >'^^t{uni)vo{uni) > ( l - e )^ 

/ = 1 
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forcing k(v) > (1 —e)^ with probability at least 1 —<5. Since 5 > 0 was arbitrary, 
this implies k(v) > (1 — e)^ a.s. and letting e go to 0 then establishes the claim. 

□ 

Let us now take a look at how this result can be used to crystallize the out¬ 
standing leanness of the infinite line among all infinite quasi-transitive graphs. 
To this end, let us first repeat the definition of quasi-transitivity. 

Definition 8 

Let G = (V, E) be a simple graph. A bijection f : V ^ V with the property 
that (/(u),/(u)) G E a and only if {u,v) G E is called a graph automorphism. 
G is said to be (vertex-) transitive if for any two vertices u,v G V there exists 
a graph automorphism / that maps u on v, i.e. f{u) = v. 

If the vertex set V can be partitioned into finitely many classes such that 
for any two vertices u, v belonging to the same class there exists a graph auto¬ 
morphism that maps u on v, the graph G is called quasi-transitive. 

Note that the notion of quasi-transitivity becomes meaningful only for infi¬ 
nite graphs as all finite graphs are quasi-transitive by definition. 

Theorem 5.3 

Consider an infinite (connected) quasi-transitive graph G = {V,E) and the ini¬ 
tial water levels to be i.i.d. unif([0,1]). Let v G V be a fixed vertex of the graph. 
If G is the line graph, that is V = T, and E = {{u,u -f 1), u G Z}, then k{v) 
depends on the initial profile. If G is not the line graph, then k{v) = 1 almost 
surely. 

Proof; Given i.i.d. unif([0, 1]) initial water levels, we can immediately conclude 
two things: If G is an infinite (connected) graph, the strong law of large numbers 
guarantees k{v) > | almost surely. 

If G is the infinite line graph, there is a positive probability that the vertex 
V is what Haggstrom [4] calls two-sidedly e-flat with respect to the initial profile 
(see La. 4.3 in [4]), i.e. 

^ v+n 

-- r]o(u) G \h — e, e] for all m, n G Nq. (14) 

TO -I- n -I-1 

u—v — m 

La. 6.3 in [4] states that in this situation, the water level at v is bound to stay 
within the interval — 6 e, 5 + 6 e] irrespectively of future updates. Together with 
the simple observation k,{v) > rjoiv), it implies that k{v) is a random variable 
with non-degenerate distribution on [ 5 , 1 ]. 

In view of Theorem 5.1, to prove the second part, we only have to verify, that 
an infinite, connected, quasi-transitive graph that is not the line graph contains 
a neighbor-rich half-line. Since G is infinite (and by our general assumptions 
both connected and having finite maximal degree) a compactness argument 
guarantees the existence of a half-line H on the vertices {vk, fc € N} such that 
vi = V and the graph distance from Vk to v is k — 1 for all k. 
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Let us consider the function d : V ^ No, where d{u) is the graph distance 
from the node n to a vertex of degree at least 3 being closest to it. Since G is 
quasi-transitive, connected and not the line graph, d is finite and can take on 
only finitely many values, which is why it has to be bounded, by C € N say. 
Consequently, G can not contain stretches of more than 2C linked vertices of 
degree 2. For this reason, there must be a vertex among ua,..., n 2 C+ 3 , say 
having a neighbor ui outside of H. In the same way, we can find a vertex U 2 
outside H having a neighbor n/( 2 ) among V 2 C+ 6 ^ ■ • ■, ^ 4 C-i -6 s-i^d in general some 
Uk not part of H but linked to a vertex Vf(^k) G {'Vk, k S N} with 

(fc - 1) (2C -k 3) -f 3 < f{k) <k{2G + 3) for all /c G N. 

This choice makes sure that and Vf(^k) &re at graph distance at least 3 for 
j ^ k, which forces the set {uk, A: G N} to consist of distinct vertices. Due to 

OO -| -j OO -j 

^ Jik) - 2G + 3 

fc=i ^ ^ fc=i 

iJ is a neighbor-rich half-line in the sense of Definition 7 as desired. □ 

Remark 

(a) Note that the essential property of the initial water levels, needed in the 
proof of Theorem 5.1, was independence. The argument can immediately be 
generalized to the situation where the initial water levels are independently 
(but not necessarily identically) distributed on [0, G] and we have some weak 
form of uniformity, namely: 

For every 5 > 0, there exists some e > 0 such that for all v G V: 

¥(^t]o{v) > G — 6) > e. 

The sequence Yfc := l{ 7 ,o(u,,)>c- 5 }) A: G N, similar to the one defined in 
the proof of Theorem 5.1 will no longer be i.i.d. Ber(£), but an appropriate 
coupling will ensure that 



almost surely, where {Zk)keN is an i.i.d. sequence of Ber(e) random variables. 
Accordingly, we get k{v) = G a.s. even in this generalized setting. 

(b) As alluded to in the introduction, the statement of Theorem 5.3 can be 
interpreted in the following way: When it comes to the qualitative behavior 
of k{v) for a fixed vertex v in the graph, the radical change does not happen 
between finite and infinite graphs but rather between the line graph Z and 
all other quasi-transitive infinite graphs, which is why the results for the Def- 
fuant model on Z can not immediately be transferred to higher-dimensional 
grids - as discussed in the introduction of Sect. 3 in [5]. 
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(c) Finally, it is worth emphasizing that Theorem 5.3 does not capture the full 
statement of Theorem 5.1: If we take the infinite line graph Z and add an 
extra neighbor to every node that corresponds to a prime number, the only 
quasi-transitive subgraph contained is the line graph itself. However, since 
it contains a neighbor-rich half-line. Theorem 5.1 states that k{v) = 1 for 
i.i.d. unif([0,1]) initial water levels and any target vertex v. 
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